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1. INTRODUCTION 


PMX1-A2 and CLX2 both support multiple region headers per region. This feature allows 
the following features to operate with reduced performance impact, 


e Opaque tri-linear texture filtering 
e Opaque multi-texturing 
e Modifier volumes 


In addition, modifier volume functionality can be enhanced to allow multiple interacting 
volume effects without modifications to base geometry. 


This document aims to demonstrate how multiple region headers can be used to 
implement/enhance these and other features. 


2. HARDWARE INTERFACE 


The hardware has been modified to allow each region to have any number of region headers 
specified for it. In addition to this the region header control word has been modified to allow 
control of, 


e Z State clearing 
e Accumulation buffer flushing 
e Translucency sort mode 


Region headers for any given region should be supplied sequentially e.g, 


Header 0 Header 1 Header 0 Header | 
Clear Depth Flush Acc, ; Clear Depth Flush Acc, ; 
Keep Depth. : Keep Depth.| ; Etc 


Region 0 ; Region | 


Note, generally for each region the first header will clear the ISP’s depth state and the last 
will flush the contents of the accumulation to the frame store. 


3. USE EXAMPLES 


The following examples are intended to demonstrate the use of multiple region headers, as 
such they are not optimised for the performance of any specific API. 
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3.1 Tri-linear Texture Filtering 


PVR2 performs tri-linear texture filtering as a two pass process i.e. two objects must be 
passed through the ISP in order to produce a single tri-linear filtered object. 


3.1.1 Why does this hit performance ? 


To get the ISP to pass both objects through to the TSP the second must be placed in the 
translucent object list irrespective of its opaque or translucent nature. This means, when tri- 
linear filtering is enabled, opaque objects will have the same per object hit on the fill rate as 
translucent ones. i.e. N opaque objects can cost up to N+1 (pre-sort mode) or N’+1 hardware 
passes (full sort mode). This compares to just one irrespective of the number of objects when 
bilinear filtering. 


3.1.2 How do multiple region headers solve this ? 


Using a second region header per region allows the second pass of the opaque tri-linear 
filtered objects to be placed in a separate opaque list. The fact that the objects for the second 
pass are contained in a separate opaque list means that the total cost of opaque tri-linear 
filtering is only two hardware passes irrespective of the number of polygons present. 


3.1.3 Limitations. 


Unfortunately the same optimisation cannot be applied to translucent tri-linear filtering, as 
each pass must immediately proceed the previous one, each set of combined object passes 
must be alpha blended into the primary accumulation buffer individually. Note, CLX2’s fast 
translucency pass must be treated in the same way as full blown translucency. 


3.1.4 Required Region Headers 
To implement the above each region would require two region headers. The first would 
contain the first pass of all opaque tri-linear filtered objects, plus all other opaque objects e.g. 


non textured, textured bilinear etc. The second header would contain all opaque tri-linear pass 
two objects plus ALL translucent objects, as demonstrated below. 
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Opaque data Pointer 


Opaque Pass | List for : 


Region N, Tri-Lin Pass A 
Header 0 EMPTY ri-Lin Pass 
EMPTY and — 
EMPTY Non Tri-lin 
Note, SM = Sort 
Opaque Pointer List Mode i.e Pre or 
for : Full Sort 
; pul, Reap 2, Busine Tri-Linear Pass B 
Region N, Opaque Pass 2 
Header 1 


EMPTY 
Translucent Pointer 
List for : 
EMPTY - 
| —SCsEMPTY qin Pace Ae 
@ and 


Non Tri-Lin 


3.2 Multi-Texturing 


PVR2 requires each texture layer to be sent through the ISP as a separate object, without 
multiple region headers this will have the same impact on opaque performance as in the tri- 
linear case. However this hit is multiplied by the number of texture layers on each object. 


3.2.1 Use of Multiple Region Headers 
For opaque objects each texture layer should be inserted into a separate region header. For N 


texture layers N+4 region headers would be required per region. These would take the 
following form (passes only represented in order to clarify diagram), 


Tex 0 Opaque Tri-Lin Pass A 
-f 


Non multi-textured opaque 


* Tex 0 Opaque Tri-Lin Pass B 
a 


Tex | Opaque 


Opaque Specular Highlights 
Opaque Fogging 


Opaque to Primary Acc Buffer 


All Trans passes 


* All passes other than the first should retain the Z state of the previous pass. 
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Note that as per the tri-linear example all the translucent layers are inserted into the last 
region header (layers are as per opaque). 


3.3. Cheap and Flexible Volume Effects 


One of the primary criticism’s of PVR2’s modifier volumes is that they are extremely 
expensive to use for even simple effects such as shadow and light volumes as double the 
amount of material data has to be supplied. This is further compounded when multiple region 
headers are used, say, for multi-texturing, as all volumes must be processed within each 
region header e.g five region headers require all volumes to be processed five times. 


3.3.1 Reactive Volume Effects 


¢é It is possible to apply a “reactive” effects to all front most pixels within a scene by inserting 
additional region header(s) that contain a single full screen modifier object plus the volumes 
that the effect should be applied too. Note, the modifier geometry can be as complex as you 
require, a single object is simply the cheapest way to achieve a global effect. 


The effect object MUST have its depth compare mode set to always and Depth Write 
disabled, this has the effect of replacing the front most ISP object TAG with that of the effect 
object without destroying the depth value. When the hardware processes the volumes 
contained with the pass (region header) the TSP will simply apply the material and blending 
of the effect object based on the inside outside information to the current contents of the 
accumulation buffer. E.g. to add a global reactive effect to both opaque and translucent data, 


All Opaque and translucent region 
headers (last should not flush to 
acc) 


List of full screen 
modifier (i.e. have 
inside and outside 
materials) objects 
that combine to 
give the desired 
effect inside the 
following 
volumes. 


3.3.2 Multiple Interacting Volumes 


Pre sort, Flush Acc, Keep Z 


As the modifier effect is specified in a separate region header any number of differing effects 
can be applied across multiple volumes. e.g. Soft edged effects can be created by supplying 
multiple concentric volumes in separate region headers. This results in the effect supplied in 
the first in the sequence of headers be applied repeatedly in an ever increasing or decreasing 
volumes (the later for shadow or light volumes). 
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3.3.3 Limitations 


The method described above does not allow volume effects to be independently applied to 
both opaque and translucent objects, as the modifier object globally replaces the front most 
ISP object TAG. As such the effect may only by applied correctly to opaque only or opaque 
and translucent together. i.e. Either the effects are visible through but not present on or 
present on but not visible through translucent objects. 


This limitation can be overcome by either handling translucency and volumes as originally 
specified by PVR2 1.e. supply inside/outside materials for all objects (doesn’t allow multiple 
interacting volumes). Or by resetting the opaque tags after having applied the effects to them 
(reset to non-modifier object) and repeating all translucent geometry with its material data 
replaced with the appropriate modifier data. 


3.4 Ordering Dependencies 


Occasionally an application wishes to clear an area of the Z buffer without effecting the 
contents of the frame store. As PVR2 imposes a strict split between opaque and translucent 
objects within each region header, this effect isn’t possible with a single region header per 
region. 


This can be overcome by inserting the “offending” translucent object(s) into the current 
header and then “starting” a new region header. The driver would then carry on building the 
region objects lists with the new header E.g. 
Opaque 
+ 
Trans with ordering 


dependency + any trans 
that precedes any 
further opaque data. 


Opaque 
+ 
Trans 


etc 


Note, if multiple region headers are being used for, say, multi-texturing, then the whole 
header “structure” will need to be restarted. 


3.5 Punch Through Translucency 

Note, this is section applies to PMX1-A2 only as it is effectively superseded in CLX2 by the 
fast translucent/punch through pass list. 

In a conventional Z buffer based system punch through translucency doesn’t need to be 


sorted by the application. As PVR uses deferred texturing (with no feed back of CK/Alpha 
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test to ISP), all punch through textured polygons need to go through a fully sort translucent 
pass. As the majority of titles are written to run on other platforms besides PVR (particularly 
D3D apps) they tend to assume that translucent data has the same full set of Z compare 
modes as opaque data. This precludes the use of full sort translucency mode all the time. 


PMX1-A2 allows punch through translucency to be grouped into a separate pass that has full 
translucency sorting enabled, so removing the Z Mode restrictions from the true translucent 


data. 
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